On Multi-view Learning with Additive Models by Mark Culp,
نویسندگان
چکیده
In many scientific settings data can be naturally partitioned into variable groupings called views. Common examples include environmental (1st view) and genetic information (2nd view) in ecological applications, chemical (1st view) and biological (2nd view) data in drug discovery. Multi-view data also occur in text analysis and proteomics applications where one view consists of a graph with observations as the vertices and a weighted measure of pairwise similarity between observations as the edges. Further, in several of these applications the observations can be partitioned into two sets, one where the response is observed (labeled) and the other where the response is not (unlabeled). The problem for simultaneously addressing viewed data and incorporating unlabeled observations in training is referred to as multiview transductive learning. In this work we introduce and study a comprehensive generalized fixed point additive modeling framework for multi-view transductive learning, where any view is represented by a linear smoother. The problem of view selection is discussed using a generalized Akaike Information Criterion, which provides an approach for testing the contribution of each view. An efficient implementation is provided for fitting these models with both backfitting and local-scoring type algorithms adjusted to semisupervised graph-based learning. The proposed technique is assessed on both synthetic and real data sets and is shown to be competitive to state-of-the-art co-training and graph-based techniques.
منابع مشابه
On Propagated Scoring for Semi-supervised Additive Models
In this paper, a semi-supervised modeling framework that combines feature-based (x) data and graph-based (G) data for classification/regression of the response Y is presented. In this semi-supervised setting, Y is observed for a subset of the observations (labeled) and missing for the remainder (unlabeled). The Propagated Scoring algorithm proposed for fitting this model is a semi-supervised fi...
متن کاملAn Iterative Algorithm for Extending Learners to a Semi-supervised Setting
In this paper, we present an iterative self-training algorithm, whose objective is to extend learners from a supervised setting into a semi-supervised setting. The algorithm is based on using the predicted values for observations where the response is missing (unlabeled data) and then incorporates the predictions appropriately at subsequent stages. Convergence properties of the algorithm are in...
متن کاملOn Multi - View Learning with Additive Models
In many scientific settings data can be naturally partitioned into variable groupings called views. Common examples include environmental (1st view) and genetic information (2nd view) in ecological applications, chemical (1st view) and biological (2nd view) data in drug discovery. Multi-view data also occur in text analysis and proteomics applications where one view consists of a graph with obs...
متن کاملHeritabilities and Genetic Correlations for Egg Weight Traits in Iranian Fowl by Multi Trait and Random Regression Models
Objective: The main objective of this research was estimation of genetic parameters for five consecutive measurements of egg weights in Isfahan fowl using multi trait model and random regression models. Methods: The statistical models included generation-hatch as a fixed effect, weeks of age as a covariate and additive genetic and individual permanent environmental effects as random effects. Th...
متن کاملHeritabilities and Genetic Correlations for Egg Weight Traits in Iranian Fowl by Multi Trait and Random Regression Models
Objective: The main objective of this research was estimation of genetic parameters for five consecutive measurements of egg weights in Isfahan fowl using multi trait model and random regression models. Methods: The statistical models included generation-hatch as a fixed effect, weeks of age as a covariate and additive genetic and individual permanent environmental effects as random effects. Th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009